Skip to main content

Internet Archive Web Crawls

The Internet Archive discovers and captures web pages through many different web crawls.



rss RSS

1,818,213
RESULTS


Show sorted alphabetically

Show sorted alphabetically

SHOW DETAILS
up-solid down-solid
eye
Title
Date Reviewed
Creator
Youtube Videos
May 8, 2023 Internet Archive
web

eye 98

favorite 0

comment 1

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl445.us.archive.org:youtube from Fri Mar 22 11:00:15 PDT 2013 to Fri Mar 22 05:43:26 PDT 2013.
favorite ( 1 reviews )
Topic: crawldata
Wide Crawl Number 16: Started June 3rd, 2017 - Still running
Apr 23, 2023 Internet Archive
web

eye 252,490

favorite 0

comment 1

Internet Archive crawldata from Webwide Crawl, captured by crawl423.us.archive.org:wide from Fri Jun 2 21:06:51 PDT 2017 to Fri Jun 2 14:32:20 PDT 2017.
favoritefavoritefavorite ( 1 reviews )
Topic: crawldata
Wide Crawl Number 17: Started August 3rd, 2018
Apr 4, 2023 Internet Archive
web

eye 7.7M

favorite 3

comment 8

Internet Archive crawldata from Webwide Crawl, captured by crawl805.us.archive.org:wide from Mon May 13 17:55:38 PDT 2019 to Mon May 13 13:45:55 PDT 2019.
favoritefavoritefavoritefavoritefavorite ( 8 reviews )
Topic: crawldata
End of Term 2020 UNT Crawls
Oct 16, 2022 University of North Texas Libraries
data

eye 10

favorite 0

comment 1

For End of Term Harvests 2020. Logs and config files of the crawls that generated WARCs with content from buildbackbetter.gov just prior to the inauguration, January 20, 2021.
favoritefavoritefavorite ( 1 reviews )
Topic: eot2020
Google Video
Dec 17, 2021 Internet Archive
web

eye 51,790

favorite 1

comment 1

Internet Archive crawldata from Google Video archiving project 2011, captured by crawl340.us.archive.org:gvdocinfo from to Wed Jul 27 15:47:37 PDT 2011.
favorite ( 1 reviews )
Topic: crawldata
Wide Crawl started February 2014
Nov 11, 2021 Internet Archive
web

eye 9.7M

favorite 0

comment 1

Internet Archive crawldata from Webwide Crawl, captured by crawl426.us.archive.org:wide from Wed Feb 19 07:58:38 PST 2014 to Wed Feb 19 05:13:46 PST 2014.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topic: crawldata
Youtube Videos
Oct 17, 2021 Internet Archive
web

eye 67

favorite 1

comment 1

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl810.us.archive.org:youtube from Fri Feb 26 01:27:26 PST 2021 to Thu Feb 25 17:45:49 PST 2021.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topic: crawldata
Live Web Proxy Crawls
Sep 24, 2021
web

eye 152.5M

favorite 5

comment 1

favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Survey Crawl Number 7
Apr 6, 2021 Internet Archive
web

eye 994,319

favorite 0

comment 1

Internet Archive crawldata from Survey Webwide Crawl, captured by crawl841.us.archive.org:survey from Sun Mar 4 08:25:03 PST 2018 to Mon Mar 5 18:20:40 PST 2018.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topic: crawldata
Wide Crawl Number 15: Started Oct 1st, 2016 - Ended May 8th, 2017
Jan 30, 2021 Internet Archive
web

eye 968,662

favorite 0

comment 1

Internet Archive crawldata from Webwide Crawl, captured by crawl814.us.archive.org:wide from Sat Oct 1 15:28:56 PDT 2016 to Sun Oct 2 23:53:49 PDT 2016.
( 1 reviews )
Topic: crawldata
Wide Crawl Number 14 - Started Mar 4th, 2016 - Ended Sep 15th, 2016
Jan 30, 2021 Internet Archive
web

eye 1.8M

favorite 0

comment 1

Internet Archive crawldata from Webwide Crawl, captured by crawl824.us.archive.org:wide from Thu Jun 16 19:15:17 PDT 2016 to Fri Jul 1 20:09:14 PDT 2016.
favoritefavorite ( 1 reviews )
Topic: crawldata
YouTube 2007 Crawl
web

eye 97,681

favorite 0

comment 1

This is an element of the Crawl IA-YOUTUBE-000 dataset from InternetArchive
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Youtube Videos
Nov 28, 2020 Internet Archive
web

eye 931,506

favorite 0

comment 1

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl440.us.archive.org:youtube from Sat Jul 21 05:39:04 PDT 2012 to Fri Jul 20 23:46:43 PDT 2012.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topic: crawldata
Live Web Proxy Crawls
Nov 28, 2020
web

eye 47.3M

favorite 4

comment 3

favoritefavoritefavoritefavoritefavorite ( 3 reviews )
Youtube Videos
Sep 15, 2020 Internet Archive
web

eye 31

favorite 0

comment 1

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl819.us.archive.org:youtube from Tue Sep 15 17:12:41 PDT 2020 to Tue Sep 15 10:32:19 PDT 2020.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topic: crawldata
Live Web Proxy Crawls
Jul 1, 2020 Internet Archive
web

eye 2.7M

favorite 0

comment 1

Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live3.us.archive.org from 2013-06-11T17:38:46 UTC to 2013-06-11T21:24:17 UTC.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topic: crawldata
Wayback Indexes
web

eye 8.1M

favorite 0

comment 1

Data crawled by Internet Archive on behalf of Internet Archive from Fri Nov 01 06:23:33 PDT 2002 to Tue Nov 19 23:24:07 PDT 2002
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topic: crawldata
Youtube Videos
Aug 1, 2019 Internet Archive
web

eye 46

favorite 0

comment 1

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl823.us.archive.org:youtube from Thu Aug 1 06:21:17 PDT 2019 to Wed Jul 31 23:31:25 PDT 2019.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topic: crawldata
Youtube Videos
Jul 4, 2019 Internet Archive
web

eye 41

favorite 0

comment 1

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl821.us.archive.org:youtube from Mon Feb 11 12:34:17 PST 2019 to Mon Feb 11 04:48:30 PST 2019.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topic: crawldata
Live Web Proxy Crawls
Apr 4, 2019 Internet Archive
web

eye 22.4M

favorite 8

comment 1

Internet Archive Liveweb Capture from WaybackMachine, captured by wwwb-proxy0.us.archive.org:wbm from Sun Mar 27 22:10:09 PDT 2011 to Mon Mar 28 05:27:05 PDT 2011.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topic: crawldata
Live Web Proxy Crawls
Mar 9, 2019
web

eye 74,065

favorite 0

comment 1

favoritefavoritefavorite ( 1 reviews )
Live Web Proxy Crawls
Dec 13, 2018 Internet Archive
web

eye 1.8M

favorite 0

comment 2

Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live3.us.archive.org from 2013-08-14T05:36:46 UTC to 2013-08-14T16:31:43 UTC.
favoritefavorite ( 2 reviews )
Topic: crawldata
google ngrams
Dec 26, 2013 Google, Inc.
web

eye 28

favorite 0

comment 1

This item contains the Google ngram data for the American English languageset. ​​​​​ Here are the datasets backing the Google Books Ngram Viewer. These datasets were generated in July 2009; we will update these datasets as our book scanning continues, and the updated versions will have distinct and persistent version identifiers (20090715 for the current set). Each of the numbered links below will directly download a fragment of the given corpus. For instance, the first ten links...
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Live Web Proxy Crawls
Aug 6, 2013 Internet Archive
web

eye 1.8M

favorite 0

comment 1

Internet Archive Liveweb Capture from WayBack Machine, captured by wwwb-live4.us.archive.org from 2013-07-13T11:35:01 UTC to 2013-07-14T12:32:30 UTC.
favoritefavoritefavoritefavoritefavorite ( 1 reviews )
Topic: crawldata
Youtube Videos
web

eye 9

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl838.us.archive.org:youtube from Tue Jan 5 16:32:45 PST 2021 to Tue Jan 5 09:24:54 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 4

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl860.us.archive.org:youtube from Thu Dec 10 05:41:12 PST 2020 to Wed Dec 9 21:44:01 PST 2020.
Topic: crawldata
Youtube Videos
web

eye 6

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl858.us.archive.org:youtube from Sun Jan 17 15:11:29 PST 2021 to Sun Jan 17 08:23:32 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 8

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl817.us.archive.org:youtube from Wed May 5 03:53:01 PDT 2021 to Tue May 4 23:00:35 PDT 2021.
Topic: crawldata
Youtube Videos
web

eye 8

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl815.us.archive.org:youtube from Tue Jan 12 19:09:50 PST 2021 to Tue Jan 12 11:32:48 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 6

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl815.us.archive.org:youtube from Thu Feb 18 11:50:43 PST 2021 to Thu Feb 18 04:05:39 PST 2021.
Topic: crawldata
Topics: crawl, logs
Youtube Videos
web

eye 8

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl819.us.archive.org:youtube from Wed Feb 10 21:46:27 PST 2021 to Wed Feb 10 15:13:33 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 8

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl820.us.archive.org:youtube from Tue Jan 26 15:07:12 PST 2021 to Tue Jan 26 08:19:12 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 6

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl819.us.archive.org:youtube from Fri Jan 1 01:25:55 PST 2021 to Thu Dec 31 17:34:56 PST 2020.
Topic: crawldata
Host Screen Captures
web

eye 13

favorite 0

comment 0

Internet Archive crawldata from Webwide Crawl, captured by crawl446.us.archive.org:widewebcap from Sat May 28 10:19:52 PDT 2016 to Sat May 28 03:21:34 PDT 2016.
Topic: crawldata
Youtube Videos
web

eye 2

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl110.us.archive.org:youtube from Wed Aug 5 21:19:34 PDT 2020 to Wed Aug 5 14:43:36 PDT 2020.
Topic: crawldata
crawl_UNK
web

eye 16

favorite 0

comment 0

This is an element of the Crawl UNK dataset from Alexa Internet
Youtube Videos
web

eye 6

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl858.us.archive.org:youtube from Fri Dec 25 02:03:34 PST 2020 to Thu Dec 24 18:21:22 PST 2020.
Topic: crawldata
Youtube Videos
web

eye 5

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl820.us.archive.org:youtube from Sun Feb 14 09:08:44 PST 2021 to Sun Feb 14 01:18:46 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 6

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl818.us.archive.org:youtube from Thu Feb 18 05:13:31 PST 2021 to Wed Feb 17 21:24:01 PST 2021.
Topic: crawldata
google ngrams
- Google, Inc.
web

eye 101

favorite 0

comment 0

This item contains the Google ngram data for the Hebrew languageset. ​​​​​ Here are the datasets backing the Google Books Ngram Viewer. These datasets were generated in July 2009; we will update these datasets as our book scanning continues, and the updated versions will have distinct and persistent version identifiers (20090715 for the current set). Each of the numbered links below will directly download a fragment of the given corpus. For instance, the first ten links below...
Topics: crawl, logs
Youtube Videos
web

eye 9

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl815.us.archive.org:youtube from Sun Feb 14 14:37:36 PST 2021 to Sun Feb 14 06:47:25 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 5

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl817.us.archive.org:youtube from Fri Jun 4 10:35:15 PDT 2021 to Fri Jun 4 03:48:35 PDT 2021.
Topic: crawldata
Youtube Videos
web

eye 10

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl815.us.archive.org:youtube from Sat Jan 9 02:40:56 PST 2021 to Fri Jan 8 18:51:13 PST 2021.
Topic: crawldata
crawl_UNK
web

eye 17

favorite 0

comment 0

This is an element of the Crawl UNK dataset from Alexa Internet
Youtube Videos
web

eye 6

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl819.us.archive.org:youtube from Sun Jan 17 14:13:27 PST 2021 to Sun Jan 17 08:52:37 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 7

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl820.us.archive.org:youtube from Sat Jan 16 06:59:47 PST 2021 to Fri Jan 15 23:25:11 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 7

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl860.us.archive.org:youtube from Sun Feb 28 02:42:40 PST 2021 to Sat Feb 27 19:12:54 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 7

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl817.us.archive.org:youtube from Mon Jan 11 14:44:50 PST 2021 to Mon Jan 11 06:52:46 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 6

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl818.us.archive.org:youtube from Wed Feb 17 08:09:33 PST 2021 to Wed Feb 17 00:24:31 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 26

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl858.us.archive.org:youtube from Sat Aug 29 21:29:54 PDT 2020 to Sat Aug 29 14:46:21 PDT 2020.
Topic: crawldata
Youtube Videos
web

eye 4

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl819.us.archive.org:youtube from Thu Jun 17 03:34:21 PDT 2021 to Wed Jun 16 21:00:53 PDT 2021.
Topic: crawldata
crawl_UNK
web

eye 28

favorite 0

comment 0

This is an element of the Crawl UNK dataset from Alexa Internet
Youtube Videos
web

eye 7

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl817.us.archive.org:youtube from Sat Jan 9 19:56:18 PST 2021 to Sat Jan 9 12:05:24 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 6

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl819.us.archive.org:youtube from Tue Jan 19 19:37:12 PST 2021 to Tue Jan 19 12:31:33 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 5

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl823.us.archive.org:youtube from Wed Jan 27 14:50:30 PST 2021 to Wed Jan 27 07:10:58 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 7

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl820.us.archive.org:youtube from Sun Feb 14 19:20:15 PST 2021 to Sun Feb 14 11:29:04 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 5

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl818.us.archive.org:youtube from Wed Feb 10 21:06:34 PST 2021 to Wed Feb 10 13:23:19 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 3

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl837.us.archive.org:youtube from Thu Oct 22 19:29:37 PDT 2020 to Thu Oct 22 12:41:42 PDT 2020.
Topic: crawldata
Youtube Videos
web

eye 6

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl815.us.archive.org:youtube from Tue Jan 26 02:01:51 PST 2021 to Mon Jan 25 18:28:54 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 4

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl819.us.archive.org:youtube from Sat Jan 2 13:11:57 PST 2021 to Sat Jan 2 05:18:42 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 4

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl858.us.archive.org:youtube from Mon Oct 19 09:44:15 PDT 2020 to Mon Oct 19 02:59:11 PDT 2020.
Topic: crawldata
Youtube Videos
web

eye 5

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl822.us.archive.org:youtube from Sat Jan 9 09:10:58 PST 2021 to Sat Jan 9 01:45:59 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 6

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl818.us.archive.org:youtube from Mon Jan 11 08:52:46 PST 2021 to Mon Jan 11 01:00:43 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 8

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl819.us.archive.org:youtube from Thu Jan 14 03:41:32 PST 2021 to Wed Jan 13 20:44:02 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 6

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl821.us.archive.org:youtube from Sun Dec 20 11:01:34 PST 2020 to Sun Dec 20 04:15:34 PST 2020.
Topic: crawldata
Youtube Videos
web

eye 6

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl817.us.archive.org:youtube from Thu Jan 7 10:54:42 PST 2021 to Thu Jan 7 03:02:42 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 3

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl824.us.archive.org:youtube from Thu Oct 22 19:12:43 PDT 2020 to Thu Oct 22 12:27:15 PDT 2020.
Topic: crawldata
Youtube Videos
web

eye 7

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl815.us.archive.org:youtube from Sun Feb 21 13:18:23 PST 2021 to Sun Feb 21 05:36:14 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 7

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl857.us.archive.org:youtube from Mon Feb 1 08:16:19 PST 2021 to Mon Feb 1 00:45:28 PST 2021.
Topic: crawldata
Youtube Videos
web

eye 4

favorite 0

comment 0

Internet Archive crawldata from YouTube Video archiving project 2011, captured by crawl860.us.archive.org:youtube from Fri Sep 25 18:09:43 PDT 2020 to Fri Sep 25 11:20:42 PDT 2020.
Topic: crawldata